Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Free, publicly-accessible full text available December 1, 2026
-
Free, publicly-accessible full text available June 1, 2026
-
Enhancing accurate molecular property predic- tion relies on effective and proficient representa- tion learning. It is crucial to incorporate diverse molecular relationships characterized by multi- similarity (self-similarity and relative similarities) (Wang et al., 2019) between molecules. However, current molecular representation learning meth- ods fall short in exploring multi-similarity and of- ten underestimate the complexity of relationships between molecules. Additionally, previous multi- similarity approaches require the specification of positive and negative pairs to attribute distinct pre- defined weights to different relative similarities, which can introduce potential bias. In this work, we introduce Graph Multi-Similarity Learning for Molecular Property Prediction (GraphMSL) framework, along with a novel approach to for- mulate a generalized multi-similarity metric with- out the need to define positive and negative pairs. In each of the chemical modality spaces (e.g., molecular depiction image, fingerprint, NMR, and SMILES) under consideration, we first de- fine a self-similarity metric (i.e., similarity be- tween an anchor molecule and another molecule), and then transform it into a generalized multi- similarity metric for the anchor through a pair weighting function. GraphMSL validates the effi- cacy of the multi-similarity metric across Molecu- leNet datasets. Furthermore, these metrics of all modalities are integrated into a multimodal multi-similarity metric, which showcases the po- tential to improve the performance. Moreover, the focus of the model can be redirected or cus- tomized by altering the fusion function. Last but not least, GraphMSL proves effective in drug dis- covery evaluations through post-hoc analyses of the learnt representations.more » « less
An official website of the United States government

Full Text Available